[BugFix]fix Qwen3 MoE call gate twice by jikunshang · Pull Request #40664 · vllm-project/vllm

jikunshang · 2026-04-23T01:04:55Z

Purpose

Qwen3 MoE model will call gate gemm twice, we find this in xpu kernel profiling. thanks @zufangzhu raise this.
see discussion here #35326 (comment)
This PR just follow deepseek_v2.py and other modeling file use is_internal_router
Moreover, there may be some dead code, will address in follow up PRs.

cc @robertgshaw2-redhat @bnellnm

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

gemini-code-assist

Code Review

This pull request updates the forward method in vllm/model_executor/models/qwen3_moe.py to support internal routing within the FusedMoE class. It introduces a conditional check to determine whether to use an internal router or an external gate for computing router logits, providing flexibility for different MoE implementations. I have no feedback to provide as there were no review comments.

Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> Signed-off-by: Avinash Singh <avinashsingh.rcoem@gmail.com>

Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> Signed-off-by: Adrian <info@zzit.ch>

fix Qwen3 MoE call gate twice

2fed0ff

Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>

jikunshang requested review from sighingnow and vadiklyutiy as code owners April 23, 2026 01:04

claude Bot reviewed Apr 23, 2026

View reviewed changes

mergify Bot added qwen Related to Qwen models bug Something isn't working labels Apr 23, 2026

robertgshaw2-redhat approved these changes Apr 23, 2026

View reviewed changes

robertgshaw2-redhat enabled auto-merge (squash) April 23, 2026 01:08

github-actions Bot added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 23, 2026

gemini-code-assist Bot reviewed Apr 23, 2026

View reviewed changes

xinyu-intel approved these changes Apr 23, 2026

View reviewed changes

bnellnm approved these changes Apr 23, 2026

View reviewed changes

robertgshaw2-redhat merged commit 342c58b into vllm-project:main Apr 23, 2026
62 checks passed

avinashsingh77 pushed a commit to avinashsingh77/vllm that referenced this pull request Apr 27, 2026

[BugFix]fix Qwen3 MoE call gate twice (vllm-project#40664)

18ad1cf

Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> Signed-off-by: Avinash Singh <avinashsingh.rcoem@gmail.com>

Lafunamor pushed a commit to Lafunamor/vllm that referenced this pull request May 1, 2026

[BugFix]fix Qwen3 MoE call gate twice (vllm-project#40664)

b15bb46

Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> Signed-off-by: Adrian <info@zzit.ch>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BugFix]fix Qwen3 MoE call gate twice#40664

[BugFix]fix Qwen3 MoE call gate twice#40664
robertgshaw2-redhat merged 1 commit intovllm-project:mainfrom
jikunshang:kunshang/moe_gate

jikunshang commented Apr 23, 2026 •

edited

Loading

Uh oh!

claude Bot left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

jikunshang commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jikunshang commented Apr 23, 2026 •

edited

Loading